Modeling prosody for language identification on read and spontaneous speech

نویسندگان

  • Jean-Luc Rouas
  • Jérôme Farinas
  • François Pellegrino
  • Régine André-Obrecht
چکیده

This paper deals with an approach to Automatic Language Identification using only prosodic modeling. The actual approach for language identification focuses mainly on phonotactics because it gives the best results. We propose here to evaluate the relevance of prosodic information for language identification with read studio recording (previous experiment [1]) and spontaneous telephone speech. For read speech, experiments were performed on the five languages of the MULTEXT database [2]. On the MULTEXT corpus, our prosodic system achieved an identification rate of 79 % on the five languages discrimination task. For spontaneous speech, experiments are made on the ten languages of the OGI Multilingual telephone speech corpus [3]. On the OGI MLTS corpus, the results are given for languages pair discrimination tasks, and are compared with results from [4]. As a conclusion, if our prosodic system achieves good performance on read speech, it might not take into account the complexity of spontaneous speech prosody.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Common and Language Dependent Phonetic Differences Between Read and Spontaneous Speech in Russian, Finnish and Dutch

This preliminary study aims to reveal both common and language-specific phonetic differences between read and spontaneous speech in three typologically unrelated languages – Russian, Finnish, and Dutch. These languages differ in prosody, sound systems, speech styles, and means for conveying intonational meaning. Spontaneous speech was recorded from 5 to 8 speakers in each language. Transliterat...

متن کامل

Prosody for Mandarin speech recognition: a comparative study of read and spontaneous speech

In this paper, we present a comparative study between spontaneous speech and read Mandarin speech in the context of automatic speech recognition. We focus on analysis and modeling of prosodic features, based on a unique speech corpus that contains similar amounts of read and spontaneous speech data from the same group of speakers. Statistical analysis is carried out on tone contours and duratio...

متن کامل

Annotation Conventions and Corpus Design in the Investigation of Spontaneous Speech Prosody in Taiwanese

Understanding how intonational phrasing and focal prominence interact with lexically specified tone patterns is one of several problems in the investigation of speech processing in Chinese languages that cannot be addressed fully with read speech alone. This paper explores such problems for Taiwanese, one of the major languages in the southern Min dialect group. It outlines what is known about ...

متن کامل

Chinese Prosody and Prosodic Labeling of Spontaneous Speech

In this paper some prosodic research on read and spontaneous speech is introduced first, then the difference between read and spontaneous speech will be depicted, and finally the prosodic labeling system C-ToBI will be described.

متن کامل

Estimating speaker-specific intonation patterns using the linear alignment model

Modeling speaker-specific intonation is important in several areas, including speaker identification, verification, and imitation using text-to-speech synthesis. However the choice of the intonation model and the estimation of its parameters from spontaneous speech remains a challenge. We propose a way to estimate speaker-specific intonation parameters for a particular superpositional model, th...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2003